Fast Sound Source Localization Based on SRP-PHAT Using Density Peaks Clustering

نویسندگان

چکیده

Sound source localization has been increasingly used recently. Among the existing techniques of sound localization, steered response power–phase transform (SRP-PHAT) exhibits considerable advantages regarding anti-noise and anti-reverberation. When applied in real-time situations, however, heavy computational load makes it impossible to localize a reasonable time since SRP-PHAT employs grid search scheme. To solve problem, an improved procedure called ODB-SRP-PHAT, i.e., power phase transformation with offline database (ODB), was proposed by authors. The basic idea ODB-SRP-PHAT is determine possible positions using density peak clustering before store identified ODB. Then, at online positioning stage, only values ODB will be calculated. monitoring, e.g., locating speaker video conference, significantly smaller than that SRP-PHAT. Simulations experiments under real environment verified high accuracy small ODB-SRP-PHAT. In addition, anti-reverberation remained. suggested displayed good applicability environment.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining SRP-PHAT and two Kinects for 3D Sound Source Localization

The Kinect™ has been developed to recognize gestures and voice commands, through a set of cameras and microphones, respectively. This paper proposes and evaluates low-cost Sound Source Localization (SSL) solution based this off-the-shelf equipment. It consists of employing a pair of Kinect devices as an alternative for microphone array, and executing the Steered Response Power using the PHAse T...

متن کامل

A GPU Implementation of the SRP-PHAT Sound Source Localization Algorithm

Microphone arrays have been widely used for handsfree speech acquisition systems such as teleconferencing or automatic speech recognition. They typically require sound source localization for subsequent multichannel signal processing algorithms such as beamforming for enhancing speech signals. Steered response power with phase transform (SRP-PHAT) source localization has been the most popular m...

متن کامل

Comparison of Srp-phat and Multiband-popi Algorithms for Speaker Localization Using Particle Filters

The task of localizing single and multiple concurrent speakers in a reverberant environment with background noise poses several problems. One of the major problems is the severe corruption of the frame-wise localization estimates. To improve the overall localization accuracy, we propose a particle filter based tracking algorithm using the recently proposed Multiband Joint PositionPitch (M-PoPi)...

متن کامل

Model-based Clustering of DOA Data Using von Mises Mixture Model for Sound Source Localization

In this paper, we propose a probabilistic framework for model-based clustering of direction of arrival (DOA) data to obtain stable sound source localization (SSL) estimates. Model-based clustering has been shown capable of handling highly overlapped and noisy datasets, such as those involved in DOA detection. Although the Gaussian mixture model is commonly used for model-based clustering, we pr...

متن کامل

Sound Source Localization Using a Pinna-based Profile Fitting Method

In a two-microphone approach, interaural differences in time (ITD) and interaural differences in sound intensity (IID) have generally been used for sound source localization. But those cues are not effective for vertical localization in the median plane (direct front). For that purpose, spectral cues based on features of head-related transfer functions (HRTF) have been investigated, but they ar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2021

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app11010445